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4 <110> APPLICANT: DIVERSA CORPORATION 

5 LAM, David 

6 MATHUR, Eric 
TITLE OF INVENTION: ENDOGLUCANASES 
FILE REFERENCE: DIVER1150-5 
CURRENT APPLICATION NUMBER: 09/914,543 
CURRENT FILING DATE: 1998-11-22 
PRIOR APPLICATION NUMBER: PCT/US97/08793 
PRIOR FILING DATE: 1997-05-22 
PRIOR APPLICATION NUMBER: US 08/651,572 
PRIOR FILING DATE: 1996-05-22 
NUMBER OF SEQ ID NOS : 52 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO: 1 
LENGTH: 2529 
TYPE : DNA 

ORGANISM: Thermotoga maritim a 



DATE: 04/25/200 
TIME: 11:38:44 




8 <120> 
11 <130> 

13 <140> 

14 <141> 

16 <150> 

17 <151> 

19 <150> 

20 <151> 
22 <160> 
24 <170> 

26 <210> 

27 <211> 

28 <212> 

29 <213> 



~31~22"0'> FEATURE 

32 <221> NAME/KEY: CDS 

33 <222> LOCATION: (1) . . . ( 

34 <223> OTHER INFORMATION 

36 <400> SEQUENCE: 1 

37 atg ata aac gtt gca acg 

38 Met He Asn Val Ala Thr 

39 1 5 

41 gtc aac tgg ttc ggc ttt 
4 2 Val Asn Trp Phe Gly Phe 



2523) 

AEPII la Archaeal Endoglucanase 



gga gag gag 
Gly Glu Glu 



43 



20 



4 5 tgg agt agg aac tgg gag 
4 6 Trp Ser Arg Asn Trp Glu 
35 

ttc aat gcg ata: agg ctt 
Phe Asn Ala He Arg Leu 
50 

53 acg atg cca acg gcg att 

54 Thr Met Pro Thr Ala He 
65 70 



47 
49 
50 
51 



55 



gag 
Glu 

gac 
Asp 

ccc 
Pro 
55 
gac 
Asp 



aca 
Thr 

atg 
Met 
40 
ttc 
Phe 



ccg 
Pro 
25 
etc 
Leu 

tgt 
Cys 



tac gcc 
Tyr Ala 



57 ctt gac age gtc cag ata 



58 
59 



Leu Asp Ser Val Gin He 
85 



61 ctg ggc ata ttc gtg etc 



62 
63 



Leu Gly He Phe Val Leu 
100 



atg gag aaa 
Met Glu Lys 

etc gac tac 
Leu Asp Tyr 
105 



acc cca 
Thr Pro 
10 

aac tac 
Asn Tyr 

etc cag 
Leu Gin 

acc cag 
Thr Gin 

aag aac 
Lys Asn 
75 

ata ate 
He He 
90 

cac aga 
His Arg 



ata cac 
He His 

gtt gtt 
Val Val 

ate aag 
He Lys 
45 

tea gta 
Ser Val 

60 
cca gac 
Pro Asp 



etc ttt 
Leu Phe 
15 

cac ggc 
His Gly 
30 

age ett 
Ser Leu 

aaa ccg 
Lys Pro 

etc cag 
Leu Gin 



aag aag get gga 
Lys Lys Ala Gly 
95 

ata gga tge aac 
He Gly Cys Asn 
110 



gga 
Gly 

eta 
Leu 

ggc 
Gly 

ggg 

Gly 

ggt 
Gly 
80 
gac 
Asp 

ttc 
Phe 



48 



96 



144 



192 



240 



288 



336 
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65 


ata 


gaa 


ccc 


eta 


tgg 


tac 


acc 


gac 


age 


ttc 


teg 


gag 


cag 


gac 


tac 


ata 


384 


66 


He 


Glu 


Pro 


Leu 


Trp 


Tyr 


Thr 


Asp 


Ser 


Phe 


Ser 


Glu 


Gin 


Asp 


Tyr 


He 




67 






115 










120 










125 






69 


aac 


acc 


tgg 


gtt 


gaa 


gtc 


gcc 


cag 


agg 


ttc 


ggc 


aag 


tac 


tgg 


aac 


gtt 


432 


70 


Asn 


Thr 


Trp 


Val 


Glu 


Val 


Ala 


Gin 


Arg 


Phe 


Gly Lys 


Tyr 


Trp 


Asn 


Val 




71 




130 










135 










140 










73 


ate 


ggc 


gcg 


gac 


ctg 


aag 


aac 


gaa 


eee 


cac 


age 


tea 


age 


ccc 


gea 


cet 


480 


74 


He 


Gly Ala 


Asp 


Leu 


Lys 


Asn 


Glu 


Pro 


His 


Ser 


Ser 


Ser 


Pro 


Ala 


Pro 




75 


145 










150 










155 










160 




77 


gee 


gcc 


tac 


act 


gac 


gga 


agt 


ggg 


gee 


acg 


tgg 


gga 


atg 


ggc 


aac 


aac 


528 


78 


Ala 


Ala 


Tyr 


Thr 


Asp 


Gly 


Ser Gly Ala 


Thr 


Trp 


Gly Met 


Gly 


Asn 


Asn 




79 










165 










170 








175 






81 


gcc 


acc 


gac 


tgg 


aac 


ctg 


gcg 


get 


gag 


agg 


ata 


gga 


agg 


gea 


att 


ctg 


576 


82 


Ala 


Thr 


Asp 


Trp 


Asn 


Leu 


Ala 


Ala 


Glu 


Arg 


He 


Gly Arg Ala 


He 


Leu 




83 








180 










185 










190 








85 


gag 


gtt 


gcc 


cca 


caa 


tgg 


gtt 


ata 


ttt 


gtt 


qaq 




acc 


cag 


ttc 


acc 


624 


' 86 


Glu 


Val 


Ala 


Pro 


Gin 


Trp 


Val 


He 


Phe 


Val 


Glu 


Gly 


Thr 


Gin 


Phe 


Thr 




87 






195 










200 








205 










89 


acc 


ccc 


gag 


ata 


gac 


ggt 


agg 


tac 


aag 


tgg 


QQC 


cac 


aac 


gee 


tgg 


tgg 


672 


90 


Thr 


Pro 


Glu 


He 


Asp Gly Arg Tyr Lys 


Trp 


Gly 


His 


Asn 


Ala 


Trp 


Trp 




91 




210 










215 










220 








93 


ggc 


gga 


aac 


ctt 


atg 


ggt 


gtt 


agg 


aag 


tac 


cca 


qtt 


aac 


ctg 


ccc 


agg 


720 


94 


Gly 


Gly 


Asn 


Leu 


Met 


Gly Val 


Arg 


Lys 


Tyr 


Pro 


Val 


Asn 


Leu 


Pro 


Arg 




95 


225 










230 










235 










240 




97 


gac 


aag 


gtt 


gtt 


tac 


age 


ccc 


caa 


gtt 


tac 


ggt 


tea 


gaa 


gtt 


tac 


gac 


768 


98 


Asp 


Lys 


Val 


Val 


Tyr 


Ser 


Pro 


Gin 


val 


Tyr 


Gly 


Ser 


Glu 


Val 


Tyr 


Asp 




99 










245 










250 










255 






101 


cag 


ccc 


tac 


ttt 


gac 


ccc 


ggt 


gag 


ggg 


ttc 


ccc 


gac 


aac 


etc 


ccc 


gaa 


816 


102 


Gin 


Pro 


Tyr 


Phe 


Asp 


Pro Gly Glu 


Gly 


Phe 


Pro 


Asp 


Asn 


Leu 


Pro 


Glu 




103 








260 










265 








270 








105 


ata 


tgg 


tac 


cac 


cac 


ttc 


ggc 


tac 


gta 


aag 


ctt 


gat 


etc 


ggt 


tac 


cet 


864 


106 


He 


Trp 


Tyr 


His 


His 


Phe 


Gly 


Tyr 


Val 


Lys 


Leu 


Asp 


Leu 


Gly 


Tyr 


Pro 




107 






275 










280 










285 






109 


gtt 


gtt 


ata 


ggt 


gag 


ttc 


gga 


ggc 


aag 


tac 


ggc 


cat 


ggg 


gga 


gac 


eeg 


912 


110 


Val 


Val 


He 


Gly Glu 


Phe 


Gly Gly Lys 


Tyr 


Gly 


His 


Gly 


Gly Asp 


Pro 




111 




290 










295 










300 










113 


agg 


gat 


gtc 


act 


tgg 


cag 


aac 


aag 


ata 


ata 


gac 


tgg 


atg 


ate 


cag 


aac 


960 


114 


Arg 


Asp 


Val 


Thr 


Trp 


Gin 


Asn 


Lys 


He 


He 


Asp 


Trp 


Met 


He 


Gin 


Asn 




115 


305 










310 










315 










320 




117 


aaa 


ttc 


tgt 


gac 


ttc 


ttc 


tac 


tgg 


age 


tgg 


aac 


cca 


aac 


age 


ggt 


gac 


1008 


118 


Lys 


Phe 


Cys 


Asp 


Phe 


Phe 


Tyr 


Trp 


Ser 


Trp 


Asn 


Pro 


Asn 


Ser 


Gly Asp 




119 










325 










330 










335 






121 


acc 


ggt 


gga 


att 


ctg 


aag 


gat 


gac 


tgg 


acg 


aca 


ata 


tgg 


gag 


gac 


aag 


1056 


122 


Thr 


Gly 


Gly 


He 


Leu 


Lys 


Asp 


Asp 


Trp 


Thr 


Thr 


He 


Trp 


Glu 


Asp 


Lys 




123 








340 










345 








350 




125 


tac 


aac 


aac 


ctg 


aag 


agg 


etc 


atg 


gac 


age 


tgt 


tct 


gga 


aac 


gee 


act 


1104 


126 


Tyr 


Asn 


Asn 


Leu 


Lys 


Arg 


Leu 


Met 


Asp 


Ser 


Cys 


Ser 


Gly 


Asn 


Ala 


Thr 




127 






355 










360 








365 










129 


gcc 


ccg 


tec 


gtc 


ccc 


acg 


aca 


act 


aca 


aca 


aca 


age 


aca 


eeg 


cca 


acg 


1152 
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130 Ala P.O se. Val P.o T.. ... 

133 "^/j 380 

134 ?hr ?hr Thr tS Thr ?hr Ser ?S P^n ^^"^ ^''^ 

135 385 390 ^^"^ "^^^ ^'^^ Thr Pro 
137 acc acc act act cca act aca ao;, 400 
J3e ... ... ™. - .cc .cc ao, .0. a.. a« 

141 aat aac gtc cca ttt aaa ^-i- 415 

v.. ..e S ^ ^ - - ... 

. j«= iTr 111 m rii til z S r r 

147 435 J»J V«l Cys Asp Gly Thr Gin Cys Ala 

l"o III III '.11 III lly L" til III r T 
■ 151 450 ^ ° III *=" Trp Gly Val val Lys lU Gly 

III ti: s ?s L'? It; III lit ',11 s ir ?' »^ 

155 4 65 ^ 47° ^rp Gly Trp Glu Asp Val Tyr Lys 

157 act gca ccc cag gac att craa sno crrr. '^^^ 480 

■III Ai. p„ Gi. A=p ne gT r.i III III ill i;i '„ii III III III 

162 III III L'^y^ III Ill "0 ate ,.c at, :a\= co, 

163 Trp As„ He As„ „et hi, p^^ 

165 aag tat aac aca ato ona f«„ ™ 510 

-ir ^°^?^-tiiai;ri!nrn;n^ 

1^70 ^?p' G'?y g\'„' til Z 5 r T »• 

171 530 Phe Val Leu Pro He Lys 

S E - i S ?a'^ A- Z B I- L'^ I" 

i - - - - i s g5 La Ill - B 

S ^^.^ S - HI I- - 5 - - - S at, 

1^6 v'a'? IL' p - lit - - - |t cot t,= p aa, 

-■-87 595 ^ Ala Gly Tyr Asp Lys Gly 

189 cca gtt etc acc att a^t r,^^ ^05 

150 Pro val ?a" til a'" tT 

191 610 g^5 ° ^-^^ Val Asp Gly Arg Leu Val 

193 aac caa act t^-r rr=„ „4. ^ ^20 

- »3a Gla' ..r tli IZ lit l^r Z 1^ ^ ^ III If, 1% HI 



1200 
1248 
1296 
1344 
1392 
1440 
1488 
1536 



IfsT 

1632 
1680 
1728 
1776 
1824 
1872 
1920 
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195 625 

]ll 11"" aag aac 

198 Phe Phe Thr Phe Lys Pro Thr Lys Asn 
'^^ 645 

ttc gac tac acc aaa ttc ata gaa ata 
Phe Asp Tyr Thr Lys Phe He Glu He 
660 



199 
201 
202 
203 
205 



665 



on^ ^^"^ ^""^ tac ctg atg 

206 Gly ser Leu Thr Asn His Tyr Leu Met 



680 



209 ata tac acc aac ggg tgc acc tea ttc 



210 
211 
213 
214 



-^v. t^^^ ^^^^CL LUC 

He Tyr Thr Asn Gly Cys Thr Ser Phe 

690 695 
tgg acc ctt gac aag tac agg ttc ate 
91. ^^"^ Tyr Arg Phe He 

9^1 ^'"'^ ^tt etc gtc 

218 Thr Glu Glu Ala Met Arg Val Leu Val 
^-^^ 725 

III T"" tu^ ^""^ ^""^ act 
222 Ser Thr Thr Thr Ser Gin Thr Thr Thr 



223 



740 



635 
tac aac 
Tyr Asn 
650 

gtt gac 
Val Asp 

tec ctg 
Ser Leu 

cea tgc 
Pro Cys 

ctg gee 
Leu Ala 
715 
gga gag 
Gly Glu 
730 

tea acc 
Ser Thr 



ggc tea gag 
Gly Ser Glu 

aac tac etc 
Asn Tyr Leu 
670 

gaa ttc ggt 
Glu Phe Gly 
685 

aca gtg gac 
Thr Val Asp 
700 

cea gga aca 
Pro Gly Thr 

gtc cag cct 
Val Gin Pro 



III ^""^ tea 

226 Thr Thr Thr Thr Thr Thr Gin Thr Ser 
755 

229 ecg eeg aca ace acc gea cct get cag 
230_Pro Pro Thr Thr Thr„Ala„Pro-Ala-Gll 

770 yy5 

233 ccg gac gat ggg cag tgg cec gag gee 
235 7^? """^ 



640 
gtt gtg 
Val Val 
655 

ggc ggt 
Gly Gly 

acc gag 
Thr Glu 

gta agg 
Val Arg 

atg gee 
Met Ala 
720 
ecc get 
Pro Ala 
735 

aeg cec 
Thr Pro 



acc acc 
Thr Thr 



790 



235 785 

237 gga aac cea gag ttc tac ata gaa ata 

238 Gly Asn Pro Glu Phe Tyr He Glu He 
^-^^ 805 

111 ?f ^^"^ ^^"^ ^tg acc tac 

242 Ala Glu Ser Tyr Ala Glu Met Thr Tyr 



820 



246 III iTr ?" "» t,.t„ 



gac gta 
-Asp-Val- 

cea att 
Pro He 
795 
aac ecg 
Asn Pro 
810 

aac ttg 
Asn Leu 



aca ace cea 
Thr Thr Pro 
750 

act aca acc acc tea 
Thr Thr Thr Thr Ser 
765 

att aag etc agg tac 



— H-e~Lys~L"eu 
780 

gac agg gat 
Asp Arg Asp 

tgg aac ata 
Trp Asn He 

age age ggg 
Ser Ser Gly 
830 



Arg Tyr 

gga gac 
Gly Asp 
800 
etg age 
Leu Ser 
815 

gtt etc 
Val Leu 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



246 
247 



- -> zf^^ yau ay L ata 

His Tyr Val Gin Ala Leu Asp Ser He 

^35 o^n 

250 <210> SEQ ID NO: 2 

251 <211> LENGTH: 841 

252 <212> TYPE: PRT 

III 2Jf ^^^^NISM: Thermotoga maritima 

255 <400> SEQUENCE: 2 

256 Met He Asn Val Ala Thr Gly Glu Glu 

258 val Asn Trp Phe Gly Phe Glu Thr Pro 

259 20 25 



2400 



2448 



2496 



2529 



Thr Pro He His Leu 
10 

Asn Tyr Val Val His 
30 



Phe Gly 
15 

Gly Leu 
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260 Trp 
261 

262 Phe 
263 

264 Thr 

265 65 

266 Leu 
267 

268 Leu 
269 

270 lie 
271 

272 Asn 
273 

274 He 

275 145 

276 Ala 
277 

278 Ala 
279 

280 Glu 
281 

282 Thr 
283 

284 Gly 

285 225 

286 Asp 
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Ser Arg 
35 

Asn Ala 
50 

Met Pro 

Asp Ser 

Gly He 

Glu Pro 
115 
Thr Trp 
130 

Gly Ala 



Asn Trp Glu 
He Arg Leu 



Thr Ala He 
70 

Val Gin He 
85 

Phe Val Leu 
100 

Leu Trp Tyr 
Val Glu Val 



Asp Met Leu Leu 
40 

Pro Phe Cys Thr 
55 

Asp Tyr Ala Lys 



Ala Tyr 

Thr Asp 

Val Ala 
195 
Pro Glu 
210 

Gly Asn 



— 2-8-7 
288 Gin 
289 

290 He 
291 

292 Val 
293 

294 Arg 

295 305 

296 Lys 
297 

298 Thr 
299 

300 Tyr 
301 

302 Ala 
303 



Lys Val 



Asp Leu Lys 
150 

Thr Asp Gly 

165 
Trp Asn Leu 
180 

Pro Gin Trp 

He Asp Gly 

Leu Met Gly 
230 

Val Tyr Ser 



Met Glu Lys He 
90 

Leu Asp Tyr His 
105 

Thr Asp Ser Phe 
120 

Ala Gin Arg Phe 
135 

Asn Glu Pro His 



Gin He 

Gin Ser 
60 

Asn Pro 
75 

He Lys 
Arg He 
Ser Glu 



Ser Gly Ala Thr 
170 

Ala Ala Glu Arg 
185 

Val He Phe Val 
200 

Arg Tyr Lys Trp 
215 

Val Arg Lys Tyr 



Gly Lys 
140 
Ser Ser 
155 

Trp Gly 



Lys Ser 
45 

Val Lys 

Asp Leu 

Lys Ala 

Gly Cys 
110 
Gin Asp 
125 

Tyr Trp 



Leu Gly 

Pro Gly 

Gin Gly 
80 

Gly Asp 
95 

Asn Phe 
Tyr He 
Asn Val 



Pro Tyr 

Trp Tyr 
275 
Val He 
290 

Asp Val 
Phe Cys 
Gly Gly 



245 
Phe Asp Pro 
260 

His His Phe 



304 Thr 

305 385 

306 Thr 
307 

308 Asn 



Asn Asn 
355 
Pro Ser 
370 
Thr Thr 



Gly Glu Phe 

Thr Trp Gin 
310 

Asp Phe Phe 

325 
He Leu Lys 
340 

Leu Lys Arg 



Thr Thr 
Asn Val 



Val Pro Thr 

Thr Thr Thr 
390 

Thr Pro Thr 

405 
Pro Phe Glu 



Pro Gl n_Va l_Ty:r- 
250 

Gly Glu Gly Phe 
265 

Gly Tyr Val Lys 
280 

Gly Gly Lys Tyr 
295 

Asn Lys He He 

Tyr Trp Ser Trp 
330 

Asp Asp Trp Thr 
345 

Leu Met Asp Ser 
360 

Thr Thr Thr Thr 
375 

Ser Thr Pro Thr 



He Gly 

Glu Gly 

Gly His 
220 
Pro Val 
235 

Gl-y~Ser 



Ser Pro 

Met Gly 

Arg Ala 
190 
Thr Gin 
205 

Asn Ala 
Asn Leu 
Gl'u~Val- 



Ala Pro 
160 
Asn Asn 
175 

He Leu 

Phe Thr 

Trp Trp 

Pro Arg 
240 



Pro Asp 

Leu Asp 

Gly His 
300 
Asp Trp 
315 

Asn Pro 
Thr He 
Cys Ser 



Asn Leu 
270 
Leu Gly 
285 

Gly Gly 



Thr Thr Thr Thr 
410 

He Val Asn Val 



Thr Ser 
380 
Thr Thr 
395 

Thr Thr 



Met He 

Asn Ser 

Trp Glu 
350 
Gly Asn 
365 

Thr Pro 



Thr Gin 
Thr Thr 
Leu Pro Thr Ser 



"TyFlVsp" 
255 

Pro Glu 

Tyr Pro 

Asp Pro 

Gin Asn 
320 
Gly Asp 
335 

Asp Lys 

Ala Thr 

Pro Thr 

Thr Pro 
400 
Pro Ser 
415 

Ser Gin 
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L:371 M:281 W: 
L:374 M:258 W: 
L:2276 M:281 W 
L:2279 M:258 W: 
L:2335 M:281 W: 
L:2338 M:258 W: 
L:2544 M:281 W: 
L:2547 M:258 W: 
L:2753 M:281 W: 
L:2756 M:258 W: 
I':2972 M:281 W 
L:2975 M:258 W: 
L:3188 M:281 W: 
L:3"191 M:258 W: 
L:3409 M:281 W: 
L:3412 M:258 W: 
L:3672 M:281 W: 
l-:3675 M:258 W: 
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Numeric Fields not Ordered, 
Mandatory Feature missing, 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing. 
Numeric Fields not Ordered 
Mandatory Feature missing 
Numeric Fields not Ordered, 
Mandatory Feature missing 



<5?oi^f^''^ ^" ascending order! 
<^20> not found for SEQ id# • 3 
, <221> sort in ascending order 1 
<220> not found for SEQ id#:17 

<7lot ^""l ascending order! 
<220> not found for SEQ id#:19 

<220> no^^ i'^/scending order! 
<220> not found for SEQ ID#:21 

'<J2of nn^^ '° ^s^ending order! 
<220> not found for SEQ ID#:23 

'<220>^^''^ " ascending order! 

Ito^^ f """"^ SEQ ID#:25 

'<22oi nT^ " ^^^^nding order! 
<220> not found for SEQ id#:27 

<220> nn^''^ ascending order! 
<220> not found for SEQ ID#:29 

<tlit^ ^""l " ascending order! 
<^20> not found for SEQ ID#-31 
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